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ABSTRACT 

With the rapid development of digital technologies, the analysis and 
processing of data has become an important problem. In particular, 
classification, clustering and processing of complex and multi- 
structured data required the development of new algorithms. In this 
process, Deep Learning solutions for solving Big Data problems are 
emerging. Deep Learning can be described as an advanced variant of 
artificial neural networks. Deep Learning algorithms are commonly 
used in healthcare, facial and voice recognition, defense, security and 
autonomous vehicles. Image processing is one of the most common 
applications of Deep Learning. Deep Learning software is commonly 
used to capture and process images by removing the errors. Image 
processing methods are used in many fields such as medicine, 
radiology, military industry, face recognition, security systems, 
transportation, astronomy and photography. In this study, current 
Deep Learning algorithms are investigated and their relationship with 
commonly used software in the field of image processing is 
determined. 
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In recent years, the use of Deep Learning mechanisms 
and techniques in image processing applications has 
become popular. The availability of powerful 
computing environments on the Internet [1]; and the 
fact that cloud providers offer ready-made machine 
learning and artificial intelligence laboratories to 
researchers [2] have accelerated these efforts. 


The concept of machine learning has been used since 
the 1990s. But it has not evolved for many years. The 
reason is that the computers are not powerful enough, 
the data sets are small, the wrong installation and the 
use of wrong activation functions [3]. The increase in 
computer power, fast access to internet; and 
increasing technological capabilities in cloud 
environments have increased the interest in machine 
learning and deep learning approaches in recent years. 


Deep learning is a machine learning method that 
predicts the results of a given dataset and its structure 
consists of more than one artificial neural network. It 
can be termed as a subfield of machine learning. 
Researchers use Deep Learning software to analyze 
complex and large data sets and process image, text, 


and audio data more accurately and quickly. 
Behaviors that can be perceived by humans, such as 
smart home devices perceiving commands, 
autonomous vehicles distinguishing pedestrians, 
recognizing fresh and spoiled food, are among the 
topics of Deep Learning and successful results are 
obtained. 


Figure 1 Neural network structure 


Nowadays, there are many different neural network 
architectures designed for different purposes. The 
simple architecture of a neural network is shown in 
Figure 1. The names of these architectures are defined 
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by the type of layers used and the way the layers are 
interconnected. 


The deep learning algorithm Convolutional Neural 
Network (CNN) is still widely used for object 
recognition [4, 5]. CNN has shown successful results 
in image processing applications such as image 
segmentation and classification [6]. As an example, 
consider the image of a bird in Figure 2. To determine 
whether it is really a bird or some other object, send 
the pixels of the image as arrays to the input layer of 
the neural network (multilayer networks for object 
classification). Hidden layers implement various 
computational methods and _ perform feature 
extraction by making changes. Convolutional layer 
performs the process of feature extraction from the 
image sent to the system. As the last layer, there is a 
fully connected layer that defines the object in the 
image. 
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Figure 2 Convolutional Neural Network to 
identify the image of a bird 


Convolutional Neural Networks are inspired by 
biology. CNN consists of three basic layers: 
Convolutional Layer, Pooling Layer and Fully- 
connected Layer. Any number of Convolutional and 
Pooling layers can be applied sequentially. Then the 
fully-connected layer is used. If the classification 
problem is to be solved with multiple labels, the 
softmax layer is used as the last layer. In the fully 
connected layer, the three dimensional input is 
reduced to one dimension [7]. In the literature, there 
are many deep learning models that use 
Convolutional Neural Network (CNN). The most 
commonly used CNN models include LeNet, 
AlexNet, ZFNet, VGGNet, GoogLeNet and ResNet. 


LeNet uses a 5-layer CNN. It was developed in 1998 
to classify handwriting on bank checks. This model 
uses mean pooling method to reduce the size [8]. 
AlexNet consists of 25 layers. It uses max pooling 
and softmax activation function. Due to the pooling 
layer, it is similar to LeNet [9]. ZFNet has a similar 
structure to AlexNet. In this model, the filter sizes in 
the first layer were changed and the object detection 
was improved. Moreover, a new technique called 
Deconvolution Network was developed in this model 
and the success rate was increased [10]. VGGNet is a 
CNN model supported by a graphics processing unit 


(GPU). In this model, the double or triple 
convolutional layers are followed by the pooling 
layer. Although the number of layers is high in this 
model, the data size decreases from input to output 
[11]. GoogLeNet is a CNN model with 144 layers. 
This model follows a different sequential approach 
using parallel network segments [12]. ResNet consists 
of 152 CNN layers. It has a deeper structure 
compared to other CNN architectures [13]. 


I. DEEP LEARNING APPLICATIONS 
There are many software/libraries developed for Deep 
Learning. In this part of the article, you will find 
information about the most commonly used software 
and its features. 


II. THEANO 

It was developed by the Université de Montréal in 
Python. Of course, it uses the NumPy library of the 
Python language. It has GPU support. With GPU 
support, it can perform operations 140 times faster 
than CPU. It can also perform mathematical 
calculations using KERAS and BLOCKS 
applications. It can effectively define mathematical 
expressions, including multi-dimensional arrays, 
enabling optimization and evaluation. It can run ona 
variety of operating systems. It can be pre-trained by 
Lasagne's Model Zoo. It is capable of running on 
multiple nodes. It has extensive unit testing and self- 
validation features. 


I... TENSORFLOW 

Written in C++ and Python by the Google Brain 
team. There is support for Linux, Windows and Mac 
operating systems OS. It has GPU support. It is a free 
and open source software library. It focuses on neural 
network training and inference. It can perform its 
computations on TPUs and GPUs. Today, it supports 
Python as well as many other languages such as C++, 
Java, C#, Javascript, and R. It is widely used. The 
reason for its popularity is that TensorFlow libraries 
are prepared for different platforms. It consists of a 
large library prepared for mobile apps, loT apps, web 
apps, and artificial intelligence apps. Developers use 
TensorFlow for artificial intelligence algorithms that 
are used on the most popular mobile devices 


IL. CAFFE 

It was developed by the Berkeley Vision and 
Learning Center in the C++ software language. It has 
support for multiple platforms. It has GPU support. It 
encourages development with an extensible code 
structure. It provides good solutions for research 
experiments and industrial applications. It supports 
deep learning algorithms for image classification and 
image segmentation. With an Nvidia K40 GPU, it can 
process over 60 million images per day. Caffe's 
accessible fast ConvNet implementation. Caffe 
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software supports CNN, RCNN, LSTM and fully 
connected neural networks. 


HI. IMAGE PROCESSING APPLICATIONS 
This section will describe two commonly used image 
processing applications. 


Iii... YOLO 

It is an algorithm that has been widely used in recent 
years for computer recognition of objects. Its most 
important feature is real-time object recognition. The 
general average accuracy (mAP) values used in object 
recognition are widely used because they are better 
than others. In Yolo algorithm, a single neural 
network is applied to the image. The network divides 
the image into regions. It estimates the bounding 
boxes and probabilities for each region and the 
bounding boxes are weighted by the probabilities 
determined. Yolo looks at the entire image once 
during the test. Unlike the R-CNN algorithm, Yolo 
makes predictions with a single network evaluation. 
This way of working makes it 1000 times faster than 
the R-CNN algorithm and 100 times faster than the 
Fast R-CNN algorithm [14]. 


HL. SSD 

SSD algorithm is used for real-time object detection. 
SSD speeds up the process as the region does not 
know the bid network. However, it uses optimization 
methods such as multiscaling and standard boxes to 
improve the accuracy drop. With these improvements, 
SSD tries to match the accuracy of Faster R-CNN on 
low resolution images and further increase the speed. 
The SSD algorithm consists of 2 stages. In the first 
stage, feature maps are created. In the second stage, 
convolutional filters are applied to detect objects. 


IV. RELATED WORKS 

There are many articles in the literature about the 
LeNet algorithm used for recognizing objects and 
scripts. Recognition of Arabic letters and numbers 
using LeNet-5 [15], recognition of gasses for 
electronic noses [16], recognition of road signs using 
the optimized LeNet5 algorithm [17], recognition of 
Covid-19 disease from CT images using LeNet-5 
architecture [18] has been observed. 


While examining the studies using AlexNet Deep 
Learning model, high performance was observed in 
vegetable classification [19], a review of AlexNet, 
AlexNet and VGG-16 for ear detection [20], a 
proposed AlexNet architecture for diabetic 
retinopathy image classification [21], AlexNet and 
VGG architecture using non-foldable layers. There 
are studies such as comparison [22]. The ZFNet deep 
learning model can be considered as a version of the 
AlexNet algorithm. It has been used in areas such as 
detection of defects in corn plants [23], detection of 


fungal diseases in plants [24], classification of fruits 
sold in retail stores [25], and diagnosis of Covid 19 
disease using Deep Learning [26]. The VGGNet 
algorithm has been used in_ painting style 
classification [27], chicken disease identification [28], 
motion detection [20], fish species classification [29], 
single shot multiple object detection [30]. GoogLeNet 
algorithm has been used to test the reliability of CNN 
on GPUs [31], for face recognition and classification 
[32], for artificial intelligence based classification of 
clothing [33], for Covid-19 disease detection [34]. 


Theano software has been used by researchers for 
machine learning applications in finance [35] and for 
water vapor removal applications in TDS data [36]. 
Tensor Flow is the most widely used library for 
image processing software. It has been used for static 
number crunching [37], X-ray image classification 
[38], and machine learning in business and finance 
[39]. The Yolo image processing algorithm is used to 
solve many image processing problems, such as one- 
step object detection, detection of thermal objects in 
harsh weather conditions, accurate detection of apple 
blossoms in natural environments, detection of 
medical masks in the fight against Covid-19, and 
detection of hazards on sidewalks. 


V. CONCLUSION 

Deep learning algorithms have been used for many 
years. The acceleration of computer processors has 
made deep learning algorithms popular again. The 
method, which initially started with artificial neural 
networks, produces effective results as the number of 
layers is increased and the manipulations within the 
layers are differentiated. A foldable neural network 
model that can be created for one problem may not be 
a suitable solution for another problem. Deep learning 
models are commonly used to solve image processing 
problems. Libraries created for image processing 
make problem solving very easy. Libraries created for 
image processing are constantly being updated and 
given new capabilities. Problems that could not be 
solved with the first versions of the libraries are 
solved faster with the new versions 
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